# Cross-modal generation
Show O2 7B
Apache-2.0
Show-o2 is an improved native unified multimodal model that utilizes autoregressive modeling and flow matching techniques to support unified understanding and generation of text, image, and video modalities.
Text-to-Image
S
showlab
198
6
Ming Lite Omni
MIT
A lightweight unified multi-modal model that efficiently processes various modal data such as images, texts, audios, and videos, and performs excellently in speech and image generation.
Multimodal Fusion
Transformers

M
inclusionAI
4,215
103
Featured Recommended AI Models